Self-Annotation for fine-grained geospatial relation extraction
نویسندگان
چکیده
A great deal of information on the Web is represented in both textual and structured form. The structured form is machinereadable and can be used to augment the textual data. We call this augmentation – the annotation of texts with relations that are included in the structured data – self-annotation. In this paper, we introduce self-annotation as a new supervised learning approach for developing and implementing a system that extracts finegrained relations between entities. The main benefit of self-annotation is that it does not require manual labeling. The input of the learned model is a representation of the free text, its output structured relations. Thus, the model, once learned, can be applied to any arbitrary free text. We describe the challenges for the self-annotation process and give results for a sample relation extraction system. To deal with the challenge of finegrained relations, we implement and evaluate both shallow and deep linguistic analysis, focusing on German.
منابع مشابه
Fine-Grained Geographical Relation Extraction from Wikipedia
In this paper, we present work on enhancing the basic data resource of a context-aware system. First, we introduce a supervised approach to extracting geographical relations on a fine-grained level. Second, we present a novel way of using Wikipedia as a corpus based on self-annotation. A self-annotation is an automatically created high-quality annotation that can be used for training and evalua...
متن کاملFine-Grained Certainty Level Annotations Used for Coarser-Grained E-Health Scenarios - Certainty Classification of Diagnostic Statements in Swedish Clinical Text
An important task in information access methods is distinguishing factual information from speculative or negated information. Fine-grained certainty levels of diagnostic statements in Swedish clinical text are annotated in a corpus from a medical university hospital. The annotation model has two polarities (positive and negative) and three certainty levels. However, there are many e-health sce...
متن کاملExtending Fine-Grained Semantic Relation Classification to Presupposition Relations between Verbs
In contrast to typical semantic relations between verbs, such as antonymy, synonymy or hyponymy, presupposition is a lexical relation that is not very well covered in existing lexical resources. It is also understudied in the field of corpus-based methods of learning semantic relations. But presupposition is very important for the quality of automatic semantic and discourse analysis tasks. In t...
متن کاملVers une double annotation des Entités Nommées
The Named Entity Recognition task has reached, this last decade, an undeniable maturity. Research on Named Entity (NE) is now taking up new challenges with fine-grained annotation and disambiguation of named entities. In this article we present a method for named entity double annotation, combining information from an automatically constructed (semantic) lexical resource that provides semantic ...
متن کاملComparing human versus automatic feature extraction for fine-grained elementary readability assessment
Early primary children’s literature poses some interesting challenges for automated readability assessment: for example, teachers often use fine-grained reading leveling systems for determining appropriate books for children to read (many current systems approach readability assessment at a coarser whole grade level). In previous work (Ma et al., 2012), we suggested that the fine-grained assess...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010